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Asymptotic sieve for primes 

By John Friedlander and Henryk Iwaniec 

1. Heuristics and statement of results 

For a long time it was believed that sieve methods might be incapable 
of achieving the goal for which they had been created, the detection of prime 
numbers. Indeed, it was shown [S], [B] to be inevitable that, in the sieve's 
original framework, no such result was possible although one could come tan- 
talizingly close. This general limitation is recognized as the "parity problem" 
of sieve theory. 

More recently, beginning with the work [IJ] , this goal has in certain cases 
become possible by adapting the sieve machinery to enable it to take advantage 
of the input of additional analytic data. There have been a number of recent 
developments in this regard, for example [H], [DFI], [FI], and several recent 
works of R.C. Baker and G. Harman. The story however is far from finished. 

In this paper we consider a sequence of real nonnegative numbers 

(1.1) A=(a n ) 

with the purpose of showing an asymptotic formula for 

(1.2) S(x) = ^a n A(n) 

n^.x 

where A(ra) denotes the von Mangoldt function. The sequence A can be quite 
thin although not arbitrarily so. Setting 

(1.3) A{x) = ^a n 

we require that A(x) be slightly larger than A{^/x); precisely 

(1.4) A(x) > A(^){\ogx) 2 . 
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It is easy to predict the main term for S(x) by using the formula 
A(n) = Y1 V(d) log d . 

d\n 

Inserting this in (1.2) we arrange S(x) as 

S(x) = ^2\ d A d (x) 

d^x 

where = —^{d) logd and A d (x) are the congruence sums 

(1.5) A d {x) = a « • 

n^x 
n=0 (mod d) 

Hence the problem reduces to that of the asymptotic evaluation of the 
sums A d (x). In practice there is no problem to obtain a bound of the correct 
order of magnitude with considerable uniformity. We assume that 

(1.6) A d (x) < d~ l r{dfA{x) 

uniformly in d < x 1 / 3 . Of course this bound is crude and we need more precise 
formulas if we are insisting on asymptotic results. We assume that we can 
approximate A d {x) by writing 

(1.7) A d {x) = g(d)A(x) + r d (x) , 

so g may be regarded as a density function and r d (x) may be regarded as a 
rather small remainder term. In the leading term we assume that g is multi- 
plicative and satisfies 

(1.8) 0< 5 (p)<l, g(p)^p-\ 

for all primes. Moreover, some regularity in the distribution of g(p) is required. 
We express this by assuming that for y ^ 2 

(1.9) Y,S(P) =loglogy + c+0((logy)- 10 ) 

with some constant c. 

Now we can write by (1.7) 

S(x) = H{x)A{x) + R(x) 

where 

(1.10) H(x) = -J2n(d)g(d)\ogd 

d^x 

and R(x) is the remainder term 

(1.11) R(x) = Y J ^dr d (x) . 

d^x 
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The hypothesis (1.9) allows us (see (2.4)) to extend (1.10) to the complete 
series getting by partial summation, 

(1.12) H(x) = H + ((log x)- 1 ) 
where H is the positive constant 

(1.13) H = -Y J KdMd) log d . 

d 

That this is positive follows since the series is also given by the infinite product 

(1.14) H = Y[{l-g(p))(l-iy . 

v 

Hence we conclude that 

(1.15) S(x) = HA(x) {1 + ({logx)- 1 )} +R{x) . 

This would finish the job if only one could ignore the remainder term R(x), 
but in general this is nothing but a wishful dream. A primary reason is that it 
is impossible in practice to control the individual error terms r^x) uniformly 
for d < x. 

We assume that the error terms satisfy 
(R) £/. 2 (d)Mi)|^(*)(log*)- 222 

d^D 

for all i < x with some D = D(x) such that 
(Rl) aJ < D(x) < x . 

The assumption (R) is quite realistic since D(x) is not necessarily required to 
be near x. But such a choice of D(x) necessitates some additional hypothesis 
by means of which one would be able to avoid an encounter with the extremely 
large moduli. 

We assume an estimate for bilinear forms of the following type: 
(B) ^ I l{n)n{mn)a mn \ s? A{x)(\ogx)~ 222 

m N<n^2N 
mn^x 

for every iV with 

(Bl) A _1 v A D < N < S^y/x 

for some 5 = S(x) ^ 2 and A = A(x) ^ 2, and where the coefficients are given 
by 

(B2) 7 ( n )= 7 (n,C)= ^ M (d) . 

d\n,d^C 
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1 < C < xD 



-i 



Remarks. Since C is relatively small the coefficients j(n, C) behave more 
or less as constant; indeed we have 7(71, 1) = 1. A thorough examination of 
j(n, C) can be found in [DIT]. 

Finally, for simplicity we assume that the sequence A is supported on 
squarefree numbers, that is 



In Section 9 we remove this assumption at the expense of strengthening slightly 
the hypothesis (R) and introducing a few extra minor conditions. 
Our main result is 

Theorem 1. Assuming the above hypotheses, we have 



where the implied constant depends only on the function g. 

In practice (B) can be established in the range (Bl) for 5 = (logx) Q and 
A = x v with some positive constants a, rj. For these choices the error term 
in (1.17) becomes 0(loglogx/logx) and this cannot be improved by refining 
our arguments. If log£(x) S> log A(x) then the error term exceeds the main 
term and (1.17) follows from the upper bound sieve. Therefore for the proof 
we may assume that x > A(x) A , A(x) ^ S(x) A , and S(x) ^ A for any fixed 
positive constant A. Note also that it suffices to prove the Theorem with A(x) 
in the inequality (Bl) replaced by A(x) A (and similarly for 5(x)). 

Most of the assumptions made above are standard ones from sieve theory. 
We have tried to choose them so that they are not difficult to verify in practice 
and have made no attempt to choose a minimal set. For example, some of 
the information about the multiplicative function g is stronger than necessary. 
The assumption (1.9) is as strong as the prime number theorem; it could be 
weakened considerably but it is not worth the effort. What needs to be said is 
that (1.9) is not crucial for breaking the parity problem. 

In the classical sieve machinery all of the above assumptions but (B) are 
present in one form or another and, of these, (R) is the fundamental one. 
As already mentioned, with (R) alone one cannot capture the primes, even 
for D(x) = x 1_e , which is the case treated by Bombieri [B]. It is clear that 
D(x) = x cannot happen in practice. Actually, for thin sequences A one cannot 
expect (R) to hold with D(x) > A(x) and for such sequences the classical sieve 
does correspondingly worse. 



(1.16) 



= 



if n(n) = . 



(1.17) 
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By introducing into the sieve axioms the additional hypothesis (B) we 
shall be able to resolve the "parity problem" and detect primes. Our theorem 
gives conditions under which one can guarantee the expected number of primes 
in the sequence A. Note that in case D(x) = x l ~ s very little is required in the 
range (Bl) of the bilinear forms, namely 

x^~ £ ' < N < x^~ £ 

for some e' > e > with e' — > and e/e' — > 0, to produce the expected 
asymptotic formula. For thin sequences, D(x) must be less than A(x) and 
then the required range (Bl) is somewhat larger but nevertheless we still retain 
the asymptotics for primes under reasonable assumptions. We could arrange 
our combinatorial arguments somewhat differently to come up with (Bl) for 
smaller ./V but it would be applicable only for sequences sufficiently dense. 

Note that the restriction N < S^^fx in (Bl) makes (B) realistic whereas 
requiring (B) for N = sjx would not be, for reasons similar to the prohibition 
of D(x) = x in (R). Were we to make this seemingly slight extension in the 
range of (B), we would be able to give a simpler proof, but of a result with 
virtually no application. We overcome the additional difficulty occurring in 
the short range 5~ l yfx ^ N ^ yfx with the aid of a device used by Bombieri 
in [B]. Also, our stipulation of the lower bound restriction N > A~ 1 y/D in 
(Bl) is essential; indeed by narrowing this slightly to N > \f~D we would not 
be able to break the parity problem. 

It is worth mentioning that the source of cancellation in the bilinear form 
in (B) comes from the sign changes of the Mobius function \i(mn) in the 
inner sum so there is no need to express this sum in terms of the remainder 
r mn (x), as in some other works, cf. [I]. There is also the Mobius function 
fi(d) in the coefficient j(n,C). Here d must be quite a bit smaller than N to 
ensure that /j,(d) does not completely neutralize /x(n). By (B1-B3) we know 
that d < C < xAND~ 3 / 2 , so our hypothesis (B) can be realistic only if D is 
somewhat larger than x 2 / 3+e . 

To get an idea of the role of (B) in enabling us to detect primes let us 
recall a well-known example of Selberg which draws attention to the failure 
of the classical set-up. Let A = (a n ) with a n = ^(1 + A(n)) where A is the 
Liouville function so a n is the characteristic function of the integers having an 
even number of prime factors, in particular a p = for every p. This sequence 
satisfies (R) with D = x 1_e but fails to satisfy (B). Even if we were to refine 
(B) replacing a mn by r mn (x) it would not be satisfied by A. Indeed we have 

2r d {x) = \(d) m + O(l). 

Thus fj,(d)rd(x) has constant sign in long intervals, namely for all d satisfying 
< d ^ -| with any integer L < y/x. 
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Of course such a sequence is rather artificial and we certainly expect that 
the sieve given here is quite capable of settling, for instance, the twin prime 
problem. The stumbling block is that in this case we have no idea how to prove 
that the relevant sequence satifies the condition (B). Even proving (R) to such 
a high level is currently beyond reach. Fortunately there are other interesting 
sequences for which these hypotheses can be verified. 

There are certainly other options for the introduction of bilinear form 
estimates to obtain primes with the sieve, see for example [DFI], but it is not 
yet clear what is the optimal general strategy if indeed there is one. Our choice 
was motivated by the desire to treat thin sequences, in particular a n being the 
number of representations n = a 2 + 6 4 . In this case (R) was established in 
[FI] with D(x) = x 3 / 4 ~ e . The bilinear form bound (B) valid in the range 
x i/4+e < j\r < x 1 / 2 (logx)~ A is given in [FI2] and this more than suffices to 
prove the asymptotic formula 

^ A(a 2 + 6 4 ) ~ 47t~ 1 kx3 , 

a 2 +b 4 ^x 

where a and b run over positive integers and k = r(|) 2 /6v / 27r. In Section 10, 
with such applications in mind, we give a modified theorem which incorporates 
a technical condition to make the axiom (B) easier to verify. 

Acknowledgment. We are happy to thank the Institute for Advanced 
Study for pleasant working conditions and Enrico Bombieri for helpful con- 
versations. We thank Andrew Granville, Mark Watkins, and the referee for 
pointing out inaccuracies in the previous draft and making suggestions which 
led us to improvements. J.F. also wishes to express thanks to Macquarie Uni- 
versity for their hospitality during part of this work. 

2. Technical reductions 

What we actually need for the proof of Theorem 1 are the following hy- 
potheses: 

(R') £ fi 2 (d)r 5 (d)\r d (t)\ « A(x)(logx)- 3 

( B ^2 T 5( m ) | £ l{n)n{mn)a mn |< ,4(x)(log x)~ 3 

m N<n^2N 
mn^x 

in the same notation and ranges as in (R) and (B). Here the implied constant 
depends only on g. 
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In this section we derive (R') and (B') from (R) and (B), respectively, using 
the crude bound (1.6). As soon as (R'), (B') are established the hypotheses 
(R), (B) and (1.6) are no longer required. 

We begin with a lemma which is reminiscent of a result due to Wolke [W]. 

Lemma 1. Fix k ^ 2. Any squarefree integer n ^ 1 has a divisor d ^ n x l k 
such that 

r(n) ^ (2r{d)) k . 

Proof. Write n = p\...p r with p\ < • • • < p r and put d = p\ . . -P[ r /h]\ 
thus d ^ n l / k and 

2r(d) = 2 1+ l r / fe ] > 2 r / k = r(n) 1 / fc 
completing the proof. □ 
We use Lemma 1 for k = 3. First we estimate the following sums: 
^V 5 (cf)iA d (x) < ^T W (d)A d {x) = ^rn(n)a n 

d^x d^x n^x 

<J>(n) 4 a n < £ (2r(d)) 12 A d (x) 

by the lemma. Then by (1.6) this is bounded by O (A(x)V(x)) where 
V{x) d-Wid) 20 « (logx) 22 ° . 

d^x 

Here, and hereafter, indicates that the summation is restricted to squarefree 
integers. Therefore 

(2.1) J> 5 (d)U d (x) « A(x)(logxf° . 

d^x 

Similarly, but using (1.9), 

(2.2) ^ b r 5 (d)l 5 (d) < [J(l + ll 5 (p)) « (log*) 11 . 

d^x p^x 

Combining (2.1) and (2.2) we infer that 

(2.3) T 5 (d)i\r d (t)\ « A(x)(logxf° . 

d^x 

Finally (R') follows from (R) and (2.3) by Holder's inequality. 
The derivation of (B') from (B) is similar. 
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We shall also need, and conclude this section by proving, the following 
consequence of (1.9): 



(2.4) v{d)g{d)^a v {\ogy) 



-6 



d^y 
(d,v)=l 



uniformly in v > 1, y > 2, where 

(2.5) a v = J] (l + ^) • 

p\u 

Let 2 < z < ^/y and P(z) be the product of the primes p < z. We split 
the sum (2.4), say G u (y), as follows: 

Gu{y) = YY n{m)n(n)g(m)g(n) 

m|P(z),(n,P(z)) = l 

Y n{m)g(m) Y K n )9(n) 

m ^\/y n^y / 1 m 

m\P(z),(m,v) = l (n,i/P(z)) = l 

+ Y Y H{m)g{m). 

n^^/y ^/y<m^y/n 
(n,vP(z)) = l m|P(z),(m,i/) = l 

Estimating the outer sums by 

< n (i+<7(p))<iogy 

we get 

G v {y)<^{\G u {w,z)\+H{y,z))\ogy 
for some w with ^fy ^ w ^ y, where 

^ ' (/>) 



G u {w,z)= Y 

and 



(n,i/P(z)) = l 



H(y,z) = Y 9(m) 



m>^/y 
m|P(z) 



We estimate H(y, z) by using Rankin's trick as follows: 

H(y,z)^y- £ Y 9(m)m 2£ = y- £ H(l + g(p)p 2£ ) 

m\P(z) P<z 

< y- £ II (1 + 8g(p)) « exp f-^y\ (log zf 
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by choosing e = (logz) 1 . To estimate G u (w,z) we consider 

F(w,z)= E K n )f( n ) 



n <C w 
(n,P(z)) = l 



where / is another multiplicative function which satisfies the hypothesis (1.9). 
By (1.9) we deduce that 



E /(p) - E f (p) 



for any u ^ u ^ z with 

E p l + ( io g z ) -10 < ^(iog 

We also have the inequality 

\F(w,z)-G l/ (w,z)\^h(z) E /(™)<?W 

which can be seen from the identity 

f(p!...p r ) - g(pi-Pr) = E /(Pl-Pj-l) (/(Pj) -^(Pj))^(Pj+l-Pr)- 

Therefore 

G„(u;, z) = F(w, z) + {h{z){\ogwf) . 

It remains to estimate F(w, z). To this end we take the simple function f(n) = 
n _1 for which we know that 

F(w,z) <C z" c + exp l-^L) (\ogzf. 

This can be proven by a standard contour integration using the bound for the 
corresponding zeta function 

E Kn)f(n)n- S = ((s + l)" 1 [J(l - p^T 1 « (log(|a| + l))(log zf 

(n,P(z))=l P<2 

valid for 

f c 1 

Res ^ —mm 



log(|s| + 2) ' logz J ' 
where c is a positive constant. Collecting the above results we arrive at 



G u (y) « ^(logz)- 10 (logy) 3 + exp [~^^ ) ^>gtjf- 
We choose logz = (log y) (log log y)~ 2 obtaining (2.4). 



1050 



JOHN FRIEDLANDER AND HENRYK IWANIEC 



3. Combinatorial identities 

For / : N — > C, an arithmetic function, we split 

f(n) if n ^ z 



f(n ^ z) 
f(n > z) 



if n > z 

if n ^ z 
f(n) if n > z. 



We have 

f(n> z) = ^2fi(b)f(c> z). 

bc\n 

Split the Mobius function /i(b) = fi(b ^ y) + fi(b > y) getting 

f(n >z) = Y J Kb^ y)f{c >z)+Y J Kb> y)f(c > z). 

bc\n bc\n 

In the first sum insert /(c > z) = /(c) — /(c < z) getting the identity (a variant 
of that of Vaughan) , 

b\n bc\n bc\n 

where F = f * 1. Split the last sum, say f(n; y, z), into three subsums 
/i(n; y,z) = fi(b > sy)f(c > sz), 

bc\n 

f 2 (n; y ,z) = ^2 K s v > h > y)f( c > z ), 

bc\n 

h{n; V,z) = ^2 Kb > sy)f(sz ^ c > z). 

bc\n 

Here s can be any number > 1. We choose s to be the power of 2 in the interval 

AJx AJx 

< s < 



85VD " A5*jD ' 
Denote 

F(n-y) = Y,^b^y)F(f ) , 

b\n 

F(n; y,z) = ^ n(b ^ y)f{c s$ z). 

bc\n 
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We have 

3 

(3.1) f(n >z) = F{n; y) - F(n; y,z) + Y J fj(n; y, z). 

3=1 

We apply the integral operator 

reY t-eZ 



per i-e/j 

I h (Y,Z)= / / h{y,z)(yz)- l dydz 

JY JZ 



to obtain an identity with smoothed parts. Note that if h is independent of 
one or both variables the integration over the relevant variable (s) does nothing. 
When we apply this operator to (3.1) we denote the results by changing y,z 
to Y, Z as appropriate. For example for the truncated function f(n > z) this 
operator gives 

/(n) if n > eZ 

f{n > Z) = { f(n) log § if Z < n < eZ 
if n s$ Z. 

We introduce this integral operator over the variables y, z in order to facilitate 
their separation in various forthcoming multiple sums. 

We shall apply the above considerations for / = A in which case F = L 
with L(n) = logn. Thus we have from (3.1), 

3 

(3.2) A(n > Z) = L(n; Y) - L(n; Y, Z) + ^ Aj(n; Y, Z). 

3=1 

We shall take n < x and make the choices 

(3.3) Y = Z = A- L VD. 

Note that when y changes over the segment Y < y ^ eY then sy satisfies the 
inequalities ^<5 — 1 ^/^r < sy < S^yfx. The same holds for the variable z. 

The obvious fact that, for n squarefree, the left-hand side of (3.2) is sup- 
ported on primes is lost when we look instead on the right-hand side (and this 
is the intention of the formula). In order to keep partial track of this infor- 
mation we introduce an upper-bound sieve {X u , v ^ A} of level A and having 
|A„| ^ 1. Such a sieve can be obtained, for example, by restricting niy) to 
certain numbers v < A, including v = 1, having the property 

Pn = ^2 K > . 

v\n 



1052 



JOHN FRIEDLANDER AND HENRYK IWANIEC 



Clearly A(n > Z) = p n A(n > Z), for squarefree n since Z > A. Therefore, 
(3.4) S(x, Z) = J2 a n K{n > Z) = ^ a nPn A{n > Z) 



n^x n^x 

3 



= T(x; Y) - T(x; Y, Z) + ^ Sj(x; Y, Z) 



where 



T(x;Y) = Y j anPnL(n;Y) 

n^x 

T(x; Y,Z) = a n PnL{n; Y, Z) 

n^x 

Sj(x; Y,Z) = J2 a nPn Aj(n- Y, Z), j = 1, 2, 3 . 

n^x 

The sum in (1.2) 

S(x) = ^aplogp 

p^x 

is closely approximated by S(x; Z); precisely we have 

(3.5) < S(x) - S(x; Z) < ^ a p \ogp « A{x){\og x)' 1 

p^eZ 

by (1.4). It remains to treat the various sums on the right side of (3.4). 

4. Evaluation of T(x; Y) 

We are able to treat the sum T(x; y) for individual x, y. We split this by 
means of log j = log n — log b, getting 

T(x;y) =T 1 (x;y)+T 2 {x;y). 

We have 



(4.1) T 1 (x;y) = J2 

n^x b\n 

= ^2 ^ anpn log 

b^y n^x 

n=0 (mod b) 

Here the inner sum is, by partial summation, 

r x dt 

(4.2) V b {x)\ogx- J V b (t)j 



n. 
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where 

(4.3) V b (x)= a n^K 

n=0 (mod b) 

= ^\ u A [vM {x) = ^\ v {g{[v,b])A{x) +r [vM {x)} . 

V V 

We have g([v, b}) = g{v)g and 

E^%(A)=E^) £ ^b)g(b). 

b^y yK ' )J d\u 

(b,u)=l 

This last inner sum is, by (2.4), bounded by O (ov(logx)- 6 ). By this and (4.2) 
we deduce that the contribution to (4.1) coming from the main term in (4.3) 
is 

« A{x)(\ogxY bS Y\\ u \g{u)T{v)a v « A(x){\og x)~ 3 . 

V 

The remainder terms in (4.3) make a contribution to (4.1) which is 

V b 

« (log x)^ T 3 (d)\r d {x)\ <^A(x)(logx)- 2 

d<Ay 

by (R') since Ay < D. Thus we obtain the bound Ti(x;y) <C A(x)(log x)~ 2 . 
We now consider 

T 2 {x-y) = -Yn(b)\ogbY j \ v A ]yM {x). 

b^y v 

We insert the approximation (1.7) for A^ v ^(x). The remainder terms give a 
contribution which may be estimated in the same way as those for Ti(x; y) giv- 
ing the same bound <C A(x)(\og x)~ 2 . Here, however, the main term provides 
also the main term for the total sum S(x). This is 

(4.4) -A(x) £ \ v g{u) £ M% (^) log 6 . 

v b^y 

We extend the summation to all b making an error <C v4(x)(log x)~ 3 , by (2.4). 
Then, we evaluate the complete inner sum as 



2 Kb)g(b)logrjb = ^2fi(ri) £ fi(b)g(b)logb 

VW (b,u)=l r)\v (b,u)=l 
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and this is equal to — H if v = 1 and equal to zero otherwise. Thus the sum 

(4.4) , and also T 2 (x,y) are given by HA(x) + O (A(x)(\og x)~ 2 ) . 
Combining this with the bound for T\{x,y) we obtain 

(4.5) T(x, y) = HA{x) + O {A(x){\og x)' 2 ) . 



5. Estimation of T(x; Y, Z) 

It again suffices to estimate the contribution for given y, z and here this 
is given by the sum 

b^y c^z v 

We insert the approximation (1.7) for A\ vb Ax). For the contribution coming 
from the main terms, arguing as with T\(x\ y), we are now led to estimate the 
sum 

E ^) E b Mc) 9 (jfe) = E ^) 9 ( (4) E b A ( c ) 9 ((^y ) 

b^y c^z b^y c^z 

c\b c\b 

and by (2.4), (1.8) this is <C T(^)ov(logy)~ 6 (log^z). Summing over v, as with 
T\(x;y) we get a contribution <C A(x)(log x)~ 3 . 

The contribution coming from the remainder terms is bounded by 



E'^'E E A (c)\r WM (x)\ < (logyz)EE \ r M (x) \<£ A{x)(log 

v b^y c^.z v d^yz 

c\b 

by (R') since yz < e 2 A~ 2 D < A~ L D. From the two estimates we conclude 
that 

(5.1) T(x;y,z) « A(x)(logx)- 2 . 



6. Estimation of S±(x; Y, Z) 

As before this may be estimated for individual y and z. We have 
Si(x;y,z) = EEE Kb)Hc)p bcd a bcd . 

bcd^x 
b>sy,c>sz 

Note that the conditions of summation imply that c is located in a short interval 
(in the logarithmic scale), namely 

(6.1) 5- l ^x~< c < <V^ , 
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and d is relatively small, namely 

(6.2) d < 5 2 . 

Therefore there is enough room for b out of which to create a nice variable. By 
dropping fi(b) we deduce that 

\Si(x;y,z)\ ^ A ( c ) PnCln 

c d n^x,cd\n 

c d v 

Of course, we have lost the possibility of detecting cancellation due to the 
sign changes of fi(b), but we have gained the congruence sums which can be 
evaluated asymptotically due to (R')- Note that [cd, u] ^ cdv < < D. 

We insert (1.7) and estimate the resulting remainder by (R') getting 

(6.3) \Si(x;y,z)\ ^ A(x)L(x)M(x) + O (A(x)(log x)' 2 ) 
where L(x), M(x) come from the main terms, namely 

L(x) = Y?Mc)g(c), 

c 

M{x) = Y^Y. X »9([dM) ■ 

d v 

Here c,d run over the segments (6.1) and (6.2) respectively. By (1.8) we infer 
that 

(6.4) L(s) « log <S. 

For the estimation of M(x) we write g([d,u]) = g(d)g(v /{y, d)) and note 

that 

V 

The latter is a property of any upper-bound sieve, and it holds for any multi- 
plicative function f(v) with ^ / < 1 in place of g(v/(v,d)). This positivity 
property allows us to introduce the factor (5 2 /d) £ into M{x) and extend the 
summation to all d (Rankin's trick) getting 

M{x) < 5 2e Kg{v) E b 9(d/{", d))d~ £ 

V d 

= ^ 2e II^ 1 + 9(p)v' £ ) Yl K9iy)h{v) 

p V 

where h{y) is the multiplicative function with 

h(p) = (1+P- £ )(1+9(P)P~T 1 < 2(1 +g(p))- 1 . 
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Note that g(p)h(p) < 1, so the sieve theory applies giving 

Y,Kg{v)h{v)<^ H(l-g(p)h( P ))= \{{l-g{p)){l+g{p)p-' £ )- 1 . 

Hence 

M{x) « 5 2£ [J (1 + g(p)p- £ ) [J (1 - g(p)) . 

We choose £ = (log A) -1 getting by (1.9), 

(6.5) M(x) < J] (1 - 5 (p)) « (log A)" 1 . 

p<A 

Combining (6.3), (6.4) and (6.5) we conclude that 

(6.6) Sl (x;y,z)<^A(x)^- . 

log A 

We remark that it is for this bound alone that we introduced the X u . 
Without doing so we would lose a logarithm, and this very tight bound cannot 
afford such a loss. The optimal choice of X u belongs to the theory of the 
2-dimensional sieve. 



7. Estimation of S 2 (x; Y, Z) 

In this section and the next we finally encounter the sums where (due to 
contamination of the variables coming from the support of p) it is necessary 
to perform the integrated estimate. Whenever a n occurs we assume n ^ x 
without writing this condition every time. We have 

S 2 {x;y,z) = J2 ^2 ^2p{b)A(c)p ebc a ebc , 

e y<b^sy c>z 

\S 2 {x;y,z)\ < (logx)^ | ^ p(b)p bk a bk \ 

k y<b^sy 

^ (logx)J2J2\X UlU2 \J2\ Kb)a Ul u 2 bk | • 

v\ v 2 k -H.<b<£2. 

"2 v 2 

In order to remove v 2 from the range of the inner summation we take advantage 
of the integration over y (still not integrating over z). 

(7.1) S 2 (x;Y,z) < {logx)J2zZ\ X ^J2 L I zZ M&Ki^bfc \ — 

v\ V2 k A y<b^sy ^ 



reY 

€ (logs) / ^r 3 (^) | Yl ^( 6 )°M 



y<b^sy 



dy 
y 
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By (B') we conclude that 

(7.2) S 2 (x;Y,z) ^A(x) (Jog x)- 1 . 

8. Estimation of S 3 (x; Y, Z) 

Recall that 

ebO'ebc ■ 

e b>sy z<c^.sz 

Here we have, for the first time, taken advantage of the fact that p e b c = p e b since 
c is prime. In essence this sum is of the same type as that in the previous section 
but with b and c interchanged. Thus by analogy to the previous argument, 
we put c into the inner summation because its range is appropriate for (B ; ). 
However, the fact that c is weighted by A rather than [i necessitates splitting 
it further in order to obtain a factor with sign changes from which we may 
hope to detect cancellation. We write 

M (c)A(c) = 2>(m)log£ = A+(c) - A-(c) 

m\c 

(recall c is squarefree) with C = xD -1 where 

A+(c) = J>Ml°g + ^ 

m\c 

A"(c) = J>K>log + £ . 

m\c 

Note that the definition of A~(c) is close to 7(0, C) in (B2); precisely we 

have 

A~(c) = y -i{c,t)t" l dt . 

This gives a contribution to S 3 (x; y, z) which, by virtue of (B') applied log 2 s 
times, satisfies 

(8.1) \S-(x;y,z)\ ^ X 7 " 4 ^) I X Mc)A~(c)a rf | < A(s)(log x)' 1 . 

£ z<c^sz 

The contribution to S%(x\y, z) from A + (c) is arranged as follows 
S + (x;y,z) = X X M&)Pe6 XX /"W( lo g + TjHfrfm 

= f (X^ 6 ) X X ^ X aewm ) j • 

^6>sj/ e&z<:r «<sz m>t ' 

eb£t<x z<lm^sz 
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Put d = eb£, so d < D and d is squarefree because of (1.16). Here the inner 
sum is equal to A d (m.in{x, ebsz}) — A d (max{dt, ebz}). Applying (1.7) this inner 
sum becomes 

(8.2) g(d) a n + r d (min{x,ebsz}) — r d (max.{dt,ebz}) . 

dt<n^x 
ebz<n^ebsz 

We first treat the contribution to S + (x;y,z) coming from the main term 
in (8.2). Recall that d = eb£. We get a significant cancellation from summation 
of n{b). Note that b ranges over the interval b m \ n < b < 6 max , where 6 m i n = 
max{sy, n/esz} and 6 max = mm{n/ez, n/e£t}. This yields 

^2 n(b)p eb g(eb£) = g{e£) ^ ii{vi)g{vi) \ UlU2 v(b)g{b) 

(b,e£)=l "I u 2 \e (6,eM)=l 

^min 

<bv\<b max 

and here, by (2.4), the inner sum is O (^^(logx) -6 ). Summing over all the 
other variables trivially we find that the main term in (8.2) contributes to 
S + (x;y,z) an amount S*(x;y,z) which satisfies 

(8.3) S*(x;y,z) « A(x)(log x)- 1 . 

The remainder terms in (8.2) contribute an amount S'(x;y,z) satisfying 

/ X b dt 

^ ^ T*{k) | r d (min{x,ksz}) - r d (max{di, kz}) \ — . 

dt<x k\d,kz<x 

Change the variable of integration t into t/d, integrate in z over Z < z < eZ 
and change z into z/k getting 

rx rx ^ ^ 

\S'(x;y,Z)\ < / r 5 (d) I r d (min{x, sz}) - r d (max{t, z}) I . 

Jz Jc kTD t z 

Now we can apply (R') obtaining 

(8.4) S'{x;y,Z) « ,4(x)(log x)" 1 . 
Collecting (8.1), (8.3) and (8.4) we conclude that 

(8.5) S 3 {x;y,Z) <C A(x)(logx) _1 . 

Combining (3.4), (3.5), (4.5), (5.1), (6.6), (7.2) and (8.5) we complete the 
proof of Theorem 1. 

9. A generalization 

Our condition that a n are supported on squarefree numbers may not be 
convenient in some applications. Therefore, in this section we establish a result 



ASYMPTOTIC SIEVE FOR PRIMES 



1059 



which does not require this condition. Throughout we assume that A = (a n ) 
satisfies all the previous hypotheses except possibly for (1.16). We replace 
(1.16) by the following condition: 
For all primes p we have 

(9.1) 0^g(p 2 )^g( P ), g(p 2 )^p- 2 . 

We may remark that, although this condition was not needed before it would 
have been natural to include it in the form g(p 2 ) = because A p i{x) = due 
to (1.16). 

Now we need to have some control on the size of the coefficients a n . We 
assume that 

(9.2) ^a^x-lA(x) 2 . 

n^x 

Remarks. The condition (9.2) holds for sequences A = (a n ) satisfying 
a n ^ n e and A(x) ^ x^ +2e . Actually our argument requires a slightly weaker 
condition, for example the bound 

(9.3) Yl a n < x~ 1 A{x) 2 D{x)^ 

n^x 

would be sufficient. 

We furthermore need a slightly stronger version of (R), namely for any 
t^iwe assume 

(R 3 ) ^ \r d (t)\ ^ A(x)L~ 2 

d<DL 2 

where L = (logx) 224 and ^ 3 restricts to cubefree moduli. 

Theorem 2. Assume the following hypotheses: (1.4), (1.6), (1.8), (1.9), 
(B), (Bl), (B2), (B3), (9.1), (9.2), (R 3 ) and (Rl). Then (1.17) holds true for 
A = (a n ). 

By analogy with the squarefree case, before proving Theorem 2 we show 
that (R3) implies 

(%) £ 3 r(d)\r d (t)\ « A(x)L-\logxf 7 . 

d<DL 2 

To this end we establish the following extension of Lemma 1. 

Lemma 2. Fix k ^ 2. Any n ^ 1 has a divisor d < n l / k such that 

r(n) (2r(d))" 1 ^" . 
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Proof. Write n = £m where I collects those primes occuring with exponent 
> k and m is made up of those having exponent < k. Thus (£, m) = 1. Put 



Ja/k] 

so that d e sc £ 1/k and 



Write m = p" 1 . . .p® r with p\ < • • • < p r . Setting d m = p± . . -P[ r /k] we have 
d m < m l / k and 

(2r(d m )) k > 2 r > Yliaj + = r(m)^S . 

3 

Take d = ded m completing the proof. □ 

We proceed to the proof of (R' 3 ). By Lemma 2 with k = 3, the left-hand 
side is bounded by 

27^%(d)> d (t)| ^ 27(^%(d) 10 Mt)|y Mt)|) 2 

d^x% d^x3 d^x3 

by Cauchy's inequality. In the first sum we replace |r<i(i)| by A^(x) + g(d)A(x) 
and estimate as follows. By (1.6) 

J2 3 r(d) 10 A d (x)^A(x) ^ 1 r(d) 18 «^(x)(logx) 218 , 

i i 

d^a;3 d^x^ 

while, by (9.1) and (1.9), 

]T 3 T(d)™g(d) < J] f 1 + 2 10 *(p) + 3 10 <7(/)) « Oog *f° . 

For the second sum we apply (R3). These give (R 3 ). 

Now we are ready to prove Theorem 2. We shall apply Theorem 1 for the 
sequence A = (a n ) with 

(9.4) a n = n 2 {n)a n . 
Let g(d) be the multiplicative function given by 

(9 .5) *o>>-*^- 
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Notice that g(p) satisfies (1.8) and (1.9) due to (9.1). For any d squarefree we 
write 

(9.6) A d (x) = Yl < = ~9{d)Mx) + f d (x) 

n=0 (mod d) 

where A{x) = A\(x). 

Although (B) for A and A coincide, it takes some effort to verify (R) for 
r d , that is 

(9.7) R(t,D) = ^ \r d (t)\ < i(x)(logx)- 222 . 

We shall derive this from (R3). Detecting squarefree numbers by the Mobius 
formula fJ, 2 (n) = Y^v 2 \n^ u ) we S e ^' f° r ^ squarefree 

A d (x) =Y^) A [v\d]{x) = g(d)GA(x) + E^H ? >V]( X ) 

V V 

where 

(9-8) G = H(l-g(p 2 )) , 

p 

so that 

(9-9) ~g{d)G = Y^)9{[v\d\) . 

In particular, 

(9.10) A(x) = GA(x) + ^n(i/)r^(x) ■ 

V 

Hence 

r d (x) = Y MM [ r [u\d] (x) - g{d)r v 2 (x)) . 

V 

Therefore we have R(t, D) <C Elogx, where 

v d^D 

Note that every k has at most r(k) representations as k = [v 2 ,d] and k is 
cubefree if it has any. By (R' 3 ) it follows that the error terms 7*^2 ^ (t) with 
v < L and d ^ D contribute 

(9.11) E x <C A{x)L- 1 {\ogx) 217 . 
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From the remaining terms we get 

^^E'E' (A [u . A (x)+g([v\d))A(x)) 

v~>L d^x 

= E 21 + E 22 + (A{x) (log x)L- 1 (log Lf) 

where £21 + E 22 is the splitting of the first term according to v > L\[~D or 
L < v < L\f~D and the second term is estimated by using (1.8), (1.9), and 
(9.1). Here B depends only on g. We estimate as follows: 

E 21 ^-nT{n) £ Kfe^yfe^) 3 E ^ 

v>L\f~D v>LsJ~D 

<C x~^A(x)Z) _ 3L^^a;5(logx) 17 ^ A(x)L - ^ (log x) 17 
by (9.2) and (Rl). For the second sum we apply Cauchy's inequality getting 

e 2 22 < ( a Ax)) (E a ™ r ( n ) 3 ) = s ^ > 



say. Here 



5i = (9{v 2 )A(x) + r v * (*)) « ^(x)L- 1 (log L) J 



by (1.8), (9.1), and (R3). To estimate S 2 we apply Lemma 2 with k = 3 getting 
by (1.6) 

<S 2 = £a„r(n) 3 s? £ ^,(x)(2r(d))W 



< A(rc) £ d -1 r(d)T^ +8 < A(x)(log 



x) 3J + 8 



d<x 



Gathering the above estimates we conclude the proof of (9.7), but with 
A(x) in place of A{x) (actually we saved at least a factor logx which is needed 
for clearing the implied constants below). Note, however, that along the above 
lines we have, by (9.10), also proved 

(9.12) A(x) = GA{x) {l + O ((log x)- 1 ) } . 

This formula gives (9.7) for the sequence Ad because x is large and it gives 
(1.6) because 

A d (x) sc A d (x) < d^ridfAix) < d-V(d) 8 I(x). 
This completes the proof that A satisfies all the hypotheses of Theorem 1. 
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The constant (1.14) corresponding to g may be expressed as 

(9.13) H = H(l-~g(p)){l-iy 1 =HG~ 1 , 

p 

with G as in (9.8), so that (9.12) yields 

(9.14) H A(x) = HA{x) {I + {{log x)- 1 )} . 

Therefore the formula (1.17) for the sequence A = {a n ) coincides with that for 
-4 = (a n ), as it has to. This completes the proof of Theorem 2. 



10. A reduction of the bilinear form 



Our new axiom (B) in sieve theory can be made technically easier to 
verify in practice if the involved integer variables have no small prime divisors. 
Having this in mind, in this section we introduce an extra restriction into the 
inner sum in (B), namely 

(10.1) (n,II) = l 

where II is the product of all primes p < P and P is given subject to 

(10.2) 2 < P < Ai/2 35 log logs . 

Notice that this restriction is possible if A > (logx) 235 . We assume in place of 
(B) that 

( B *) J2\ Y l{n)^{mn)a mn \<^A{x){\ogx)~ 22 \ 

m N<n^2N 
mn^x 

where * restricts the inner summation to numbers n free of prime factors p < P. 

Theorem 3. Replace (B) by (B*) in the assumptions of Theorem 2. Then 
(1.17) still holds. 

For the proof it is enough to show that (B*) implies (B) with 5, A replaced 
by 2(5, 2A 2 respectively. Furthermore we only need to establish (B) in the 
following integrated form 

Cf B ) / / Yl I Yl j{n,t)fj,{mn)a mn I ^ — < A(x)(logx)" 

for every iV with A _1 v / ^D < N < S^y/x in two cases j(n,t) = 1 and 

(10.3) 7(M)= E ^ • 

d\n,d^t 



-2' 
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To see this recall that (B) was used solely to estimate the sums 52 and S3 
in Sections 7 and 8. In the case of S 2 we note that (7.1) contains such an 
integration with y in place of w and 7(n, t) = 1. In the case of S3 we note that 
(8.1) may be integrated over z in place of w and there A~(c) is the integral of 
7(0, t) as required. 

We write n = n^ni where (no, II) = 1 and n\ | II. The contribution to 
(/B) from terms with n\ > A is estimated trivially as 



Wi ^ (logC) ^^^r(noni)« 2 (mn ni)a 



mriorii • 



mnoni^x 
ni>A 



Every such n\ has a divisor d with A < d < AP, so 



a re T4(n) 



d|II 

A<d^AP d\n 

By Lemma 1 for k = 4 we obtain r 4 (n) s$ r(n) 3 sC (2r(^)) 24 for some v\n, 
v ^ X4 and hence by (1.6) 



Wi<(logC) £ E (2r(z.)) 24 ^(x) «^(x)G 1 G 2 logx 

A<d^AP [v,d)=l 
where Gi, G2 are the following sums: 



Gi=E ^H 32 «(logx) 232 , 



G 2 = Y d- l r(df sc A- £ Y d'^ridf = A~ £ ]J (l + 2 V" 1 ) , 
d|n d|n p<p 

d>A 

by Rankin's trick. We choose e = (logP)" 1 so that p £ < 8p~ £ for every p ^ P 
and obtain 

G 2 ^ A- £ H (1 + 2 11 p- 1 ~ £ ) ^ A- £ ((l + e) 2 " 
v 

« A-e- 211 = e XP (-^)(logP) 211 « (log*)- 34 • 

Combining the above estimates we conclude that 
(10.4) Wi < A(x)(logx)- 2Xi . 

Next we estimate the contribution of terms with n\ < A. These give 
? 2N rC * At dw 



W 0^ / y y I 5^ 7(n ni,t)/i(mn ni)c 

JN Jl m _ I TT „.„ -0„„ 



"i ni|II to<noni^2u! 
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In the case (10.3) we have 



di\m 

Insert this and change variables t — ► td±, w — > w;ni getting 
^ 2JV rC_ * 



w o < / ^2n(m) \ ^2 l{n,t)n{mn)u t , 

Jn / A Jl m w<n^2w 

The same bound holds in the case j(n,t) = 1, in fact in stronger form since 
then the integration in t is not needed and T^{m) can be replaced by r(m). By 
the argument that gave (B) =>■ (B') it follows that (B*) implies 

^2 T 3( m ) I 7(n)fi(mn)a mn | < A(x)(log x) -2 ' 3 

m N<n^2N 

(the restriction (n, II) = 1 only helps matters). Hence we conclude that 

(10.5) W < A(x)(logx)" 222 . 

Adding (10.4) to (10.5) we obtain (/B) completing the proof of Theorem 3. 
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